The Capacity Region of Multiway Relay Channels 
Over Finite Fields with Full Data Exchange 

Lawrence Ong, Sarah J. Johnson, and Christopher M. Kellett 



Abstract — The multi-way relay channel is a multicast network 
where L users exchange data through a relay. In this paper, 
the capacity region of a class of multi-way relay channels is 
derived, where the channel inputs and outputs take values over 
finite fields. The cut-set upper bound to the capacity region 
is derived and is shown to be achievable by our proposed 
functional-decode-forward coding strategy. More specifically, for 
the general case where the users can transmit at possibly different 
rates, functional-decode-forward, combined with rate splitting 
and joint source-channel decoding, is proved to achieve the 
capacity region; while for the case where all users transmit at a 
common rate, rate splitting and joint source-channel decoding are 
not required to achieve the capacity. That the capacity-achieving 
coding strategies do not utilize the users' received signals in the 
users' encoding functions implies that feedback does not increase 
the capacity region of this class of multi-way relay channels. 



I. Introduction 

We consider the multi-way relay channel (MWRC), where 
L users (L > 2) exchange data via a relay. Each user is 
to send its data to all other users. We further consider the 
case where there is no direct link among the users. So, 
information exchange among the users can only be done 
through the relay. Common applications of this model include 
conference calls in the cellular network where mobile users 
communicate among themselves through a base station, and 
satelUte communications (see Fig. [T]). 

The MWRC is an extension of the two-way relay channel 
(TWRC) where two users exchange data via a relay (e.g., 
see |[T|-|[3l). As the TWRC embeds a relay channel, coding 
strategies designed for the relay channel were modified and 
attempted on the TWRC. These include: 

• CompIete-decode-forwarJ^ (CDF): The relay completely 
decodes the users' messages, and broadcasts them back 
to the users (see Hl-llll). 

• Compress-forward: The relay quantizes its received sig- 
nals, re-encodes and broadcasts them to the users (see 

El, H). 

• Amplify-forward: The relay simply scales and forwards 
what it receives (see ID-Ill). When applied to the 
Gaussian TWRC, this strategy is also known as analog 
network coding jS]. 

• Combinations of the above strategies (see ||6l, Q). 

• A combination of partial-decode-forward and compress- 
forward (see ISl). 

'This strategy is commonly refeiTed to as decode-forward or decode-and- 
forward. We refer to this strategy as complete-decode-forward to differentiate 
it from our proposed functional-decode-forward 




Fig. 1 . An application of the MWRC, where stations exchange information 
via a satellite 

CDF, compress-forward, and amplify-forward coding strate- 
gies for the TWRC have been extended to the Gaussian 
MWRC by Gundiiz et al. ||9l- However, none of these strate- 
gies achieve the capacity region of the MWRC in general. 

A. Functional-Decode-Forward 

Recently, functional-decode-forward (FDF) has been pro- 
posed for the TWRC, where the relay decodes a function of the 
two users' messages and broadcasts the function back to the 
users lfT0l - lfT4l . Obviously, the function must be defined such 
that each user can decode the message of the other user from 
the function and its own message. FDF was shown to achieve: 
(i) the capacity region of the binary TWRC [lOj, where the 
channels are binary symmetric, and (ii) within i bit of the 
capacity region of the Gaussian TWRC ifTSl . Linear codes are 
used in FDF for the binary channel, and lattice codes ifTSl are 
used in FDF for the Gaussian channel. FDF for the Gaussian 
TWRC was extended to the multi-pair Gaussian TWRC (where 
multiple source-destination pairs exchange data via one relay) 
by Gundiiz et al. 

In the TWRC and the multi-pair TWRC, FDF was designed 
for pair-wise data exchange. We later proposed FDF for the 
MWRC (a non-trivial extension of FDF for the TWRC) where 
multiple users exchange data via a relay at a common rate, 
and showed that FDF achieves the common-rate capacity of 
the binary MWRC |[T6l . Applying insights from the binary 
MWRC has allowed us to obtain the common-rate capacity 
of the Gaussian MWRC with three or more users where all 
nodes transmit at the same power ifTTl . 

In this paper, we extend our proposed FDF for the common- 
rate binary MWRC QH to the general-rate MWRC over a 
finite field where the channel inputs and outputs take values 
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over a finite field and where the users can possibly transmit at 
different rates. Furthermore, unlike ||9l, IfTSl . we consider the 
more general unrestricted MWRC where each user's encoding 
function at any time can depend on its own message and its 
previously received signals. Note that the binary MWRC is a 
special case of the MWRC over a finite field. 

On the uplink (the channel from the users to the relay), we 
use functional decoding combined with rate splitting. Similar 
to lfT6l . linear codes are used here. The main idea behind 
this generalization (from the binary channel to the finite field 
channel) relies on the fact that optimal (capacity-achieving) 
hnear codes can be constructed for channels over finite fields. 
Using linear codes on the uplink, the relay is able to decode 
a function of the users' codewords, which is also a codeword 
from the linear code. On the downlink (the channel from the 
relay to the users), the relay needs to send different messages 
to different users, and so the coding technique for broadcast 
channels with receiver side information developed by Tuncel 
ifTSl is used, which utilizes joint source-channel decoding. We 
show that the combination of FDF, rate splitting, and joint 
source-channel decoding achieves the capacity region of the 
MWRC over a finite fielcjl 

We shall see later that using the capacity-achieving FDF, 
the users' transmitted signals only depend on their respective 
messages and do not depend on their received signals. This 
means utilizing feedback at the users does not increase the 
capacity region of the MWRC over a finite field. 

This, to the best of our knowledge, is the first example 
of an MWRC where the capacity region is found for all 
noise distributions/levels. The optimal coding strategy for the 
MWRC over a finite field proposed in this paper gives insights 
into optimal processing/coding strategies for other classes of 
MWRCs. This work suggests that for the general MWRC, 
functional decoding should be performed at the relay, and joint 
source-channel decoding at the users. 

On the uplink of MWRCs, the relay receives interfering 
signals from all the users (see (fTbli). Such networks, where 
some node(s) receives a function (which can be noisy) of more 
than one other node's transmission, are usually referred to as 
networks with interference. Using our proposed FDF, up to 
two users are allowed to transmit at any time, and the relay 
attempts to decode a function of the users' messages. Rather 
than avoiding interference, this coding strategy embraces it and 
can thus be viewed as a form of interference alignment lfT9l . 

Remark 1: Note that linear codes are also used in other 
types of networks, including the multicast (one source sending 
data to multiple destinations) network with interference ll20l - 
ll22l . the multiple-access channel where the destination is 
to decode a linear combination of the sources' messages 
II20I . ||2T1 . and the multi-source multicast network with no 
interference 12311 . Linear codes have been shown to be optimal 
(capacity-achieving) in these networks when the channels are 
themselves linear Note that the MWRC is not a special case 
of these networks as it has multiple sources and multiple desti- 
nations, and it incorporates interference in its network model. 

-Note that rate splitting and joint source-channel decoding were not 
required for the common-rate case in 1161 . 



Furthermore, the coding strategy for the uplink developed in 
this paper is different from existing strategies. 

B. Other Related Work 

A channel model similar to the finite field channel consid- 
ered in this paper is the deterministic (noiseless) channel. In 
the deterministic model, the channel output is the arithmetic 
summation of the bit-shifted channel inputs, and there is no 
noise. The deterministic model has been used to construct 
coding strategies and to gain insights for more general chan- 
nels. This approach has been applied to the multiple-access 
channel 1241 . the broadcast channel ||24| . the interference 
channel El, EH, the deterministic TWRC 123, and the 
deterministic multi-pair TWRC l28l . For the deterministic 
TWRC and the deterministic multi-pair TWRC, it has been 
shown that linear coding achieves the capacities, an observa- 
tion similar to that in this paper for the finite field MWRC. 

The MWRC we consider herein, where each user is to 
decode the messages from all other users, can be seen as 
a generalization of the TWRC. Different extensions of the 
TWRC include: 

• The multi-pair TWRC where multiple source-destination 
pairs exchange messages via one relay l28l . l29ll . Here, 
each destination only decodes the message from one 
source. 

• The multi-pair TWRC where multiple users exchange 
messages with a base station via a relay f30l. Here, each 
user sends its message to the base station, and the base 
station sends different messages to each user 

« The TWRC with additional private messages from the 
users to the relay l3l1. l32l. 

• The MWRC where the users are separated into different 
groups and all users in each group exchange messages 
among themselves [9l. 

The MWRC has also been studied from the point of view 
of source coding, where multiple users exchange possibly 
correlated data via a relay. In the source coding setting, 
the channel from the users to the relay and that from the 
relay to the users are assumed to be noiseless. The problem 
formulation is how many bits the users need to encode their 
respective messages to be sent to the relay; and after the relay 
receives these encoded messages, how many bits the relay 
needs to transmit to the users in order for each user to recover 
the messages of all other users. The three-user lossless case 
(where each user perfectly reconstructs the other two users' 
messages) was studied by Wyner et al. l33l . the two-user 
lossless case and lossy case (where each user reconstructs the 
other user's message with a prescribed distortion) was studied 
by Su and El Gamal l34| , and the two-user lossy case with 
common reconstructions (where each user must also be able to 
determine the lossy reconstructed message of the other user) 
was studied by Timo et al. . 

C. Organization 

The rest of the paper is organized as follows. In Sec. Ull 
we describe the MWRC over a finite field, define the notation 
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Fig. 2. The L-user MWRC over a finite field _ T 

associated addition ffi and multiplication 0, where 

. . . , Vf/i,i_i, . . . , Wi.L) is user i's estimate of 

other users' messages 
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used in this paper, and quote a few lemmas that will be used 
in the later sections. We derive upper bounds to the capacity 
region and the common-rate capacity of the MWRC over a 
finite field in Sec. |III] We then construct linear codes over 
finite fields in Sec. IIVI which facilitate functional decoding 
at the relay. We derive the capacity region of the finite field 
MWRC in Sec. |V] In Sec. |Vll we use the two-user binary 
MWRC as an example to analyze why neither CDF nor FDF 
with separate source-channel decoding achieves the capacity 
region of the MWRC in general. Sec. IVIII concludes the paper. 

II. Channel Model 

Fig. 12] depicts the i-user MWRC considered in this paper, 
where there is no direct user-to-user link. Nodes 1, 2, . . . , i 
are the users, and node the relay. By definition, L > 2, and 
each user is to decode the messages from all other users, i.e., 
the users perform /mW data exchange. We denote by Xi node 
i's input to the channel, Yi the channel output received by node 
i, and Wi node i's message. We assume that the messages 
are independent. We consider a full-duplex and causal relay, 
meaning that the relay can transmit and receive at the same 
time, and that the transmit signal of the relay at any time can 
only depend on its past received signals. 

Definition 1: We define the L-user MWRC over a finite 
field T (with associated addition 0, multiplication 0, and the 
additive identity o G as follows: 

• The uplink channel is the weighted sum of all users' 
channel inputs and the relay's receiver noise: 



- {hi^o Q Xi) ® ih2,o & X2) 



(la) 

©(/il,o0^l) 
(lb) 



where Xi,NQ,YQ e J-, /i^.o ^ J' \ {0}, Vi, and A^o is 
the receiver noise and is an independent and identically 



distributed (i.i.d.) random variable for each channel use. 
The parameters hi^, Vi, are fixed and are known to all 
the nodes a priori. Recall that is a field if and only 
if |J^| = £^ for some prime number £ and some positive 
integer z. 

m The downlink consists of independent channels from the 
relay to the users: 

= (ho,, Q Xo) (B N,, Vie {1,2,..., L}, (2) 

where Xq, N,, Y, e T, hoA <^ {0}, Vi, and A^^ is the 
receiver noise at node i and is an i.i.d. random variable for 
each channel use and for each user i. Each ho^i is fixed 
for all channel uses and is known to node i a priori. 

Remark 2: The MWRC over a finite field is defined to 
resemble the wireless additive white Gaussian noise channel 
where the channel output is the sum of attenuated (usually as 
a result of path loss, which is inversely proportional to the 
node distances) channel inputs and noise. However, addition 
and multiplication over a field do not bear the same practical 
implication as those over real numbers. 

Let Xi[t] and Yi[t] denote the transmitted signal and the 
received signal of user i respectively on the t-th channel 
use. We consider the following block code of n simultaneous 
uplink and downlink channel uses, meaning that the relay and 
all users transmit Xi[t] respectively and simultaneously, for 
t e {1,2,. 

Definition 2: A (2"^i , 2"^^ , . . . , 2"^^ , n) code for the 
MWRC consists of 

1) L messages, one for each user: Wi £ Wi = 
{l,...,2"-«'}, for i e {l,2,...,i}. We denote by 

= {Wi, W2, . . ■ , Wl) the message tuple. 

2) L sets of user encoding functions, one set for each 
user: /^.t : Wi x J"*^^ J", such that Xi[t\ = 

At(w„r4i],y,[2],...,r,[t-i]),forie {i,2,...,L}, 

t S {1,2,..., n). This means that the transmit signal of 
a user at any time can depend on its message and its 
previously received signals. 

3) A set of relay encoding functions: /o,t : J^*^^ — > 
T, for t e {l,2,...,n}, such that Xo[t] = 
/o,t(il3[l], ^o[2], ■ • ■ ,yo[i— 1]). This means the transmit 
signal of the relay at any time can only depend on its 
previously received signals. 

4) L user decoding functions, one for each user: : 
J"" X >V, ^ Wl X • • • X Wi-i X Wi+i X • • • X Wl, such 

that ^ im,i,...,w,,,.um.,+i,...,m,L) = 

QiiY i, Wi), for I € {1, 2, ... , L}, where Wij is node i's 
estimate of Wj, and Y, = (F^l], K,[2], . . . [Yi[n]). This 
means each user decodes the messages sent by all other 
users based on its n received signals and the knowledge 
of its own message. 

Note that the source message Wi, which is an nRi-b\i 
message, is sent from user i to all other nodes (through 
the relay) in n channel uses, giving a rate of 2^ = Ji^ 
bits/channel use. We say that user i transmits at the rate Ri 
bits/channel use. 

In this paper, bold letters are used to denote collections of 
variables across time, e.g., X = {X[l], X[2], ...,X[k]), for 
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some integer k > 1. The length of the vector will be explicitly 
mentioned when it is not clear from the context. For a random 
variable X, we use the corresponding lower case x to denote 
its realization. 

Definition 3: Assuming that the message tuple ft = 
{Wi, W2, . . . , Wl) is uniformly distributed over the product 
set W = WixW2 X • ■ • X Wl, the average error probability 
for the (2"^i , 2"^^ , . . . , 2"^^ , n) code is defined as 



Pr I W;,j ^ Wj, for some j G {1, 2, , 
and some i e {1, 2, . . . , i} \ j| 




^ " wean 



17 =1 



(3a) 



(3b) 



, Wi) is defined as lo 



where = (wi, . . . , tUi.i, 
without the i-th entry. 

Definition 4: A rate tuple (Ri, . . . , Rl) is said to 
be achievable if, for any e > 0, there is at least one 
(2"^i , 2"^2 , . . . , 2"^^ , n) code such that < e. 

We say that a node can reliably decode a message if and 
only if the average probability that the node wrongly decodes 
the message can be made arbitrarily small. Hence, the rate 
tuple i?2, • ■ • , Rl) is achievable if each user can reliably 
decode the messages from all other users. 

Definition 5: The capacity region C is defined as the closure 
of all achievable rate tuples. 

In this paper, we also consider the common-rate case (a 
special case) where all users transmit at R = Ri, Vi G 
{1,2,..., L}. We say that the common rate R is achievable if 
the rate tuple (i?, R, . . . , R) is achievable. The common-rate 
capacity can be similarly defined: 

Definition 6: We define the common-rate capacity (also 
known as the symmetrical capacity iQ) as 



C = sup{R : (i?, R, . . . , R) is achievable}. 



(4) 



The common rate is useful in systems where all users have 
the same amount of information to send, or in fair systems 
where every user is to be given the same guaranteed uplink 
bandwidth, i.e., each user can send data up to a certain rate, 
at which all other users are able to decode. 

To simplify equations in this paper, we define 



-Rmin = min i?,- 
ie{i,2,...,L} 



(5) 
(6) 

(7) 



For a random variable X G X, H{X) = 
— Y^^^x P(^) P{^) is the entropy of X. We denote 
the uniform distribution of X by p"(a;). 




A. Existing Results 

In this section, we quote existing results that will be used 
in the later sections in this paper. 

First, for a finite field with associated operations of 
addition ©, multiplication 0, and the additive identity £ 7^, 
we have the following lemma due to Jelinek 1361 Lemma 9.3]: 

Lemma 1: Consider a finite field J^. We have the following 

1) the equation a(B x ~ b (where x is the unknown) has a 
unique solution in J^, 

2) for each a € F, the set {a © a; : a; € F} is equal to F. 

3) the equation cQy = d (where y is the unknown) has a 
unique solution in F provided c 7^ 0. 

4) for each c E F \ {0}, the set {c y : y G F} is equal 
to F. 

In this paper, we prove achievability and capacity results 
based on the properties of the set of jointly (5-typical se- 
quences, which is defined as follows: 

Definition 7: The jointly (5-typical set .4|^yj^ with respect 
to a distribution p{x, y) on X x y is the set of sequences 



{x,y) = {{xi, 
that 



yi), {X2,y2), ■ • ■ , iXn,yn)) G A"" X J^" such 



log2 piy) 



H{X) 
-H{Y) 



■log^p{x,y)-H{X,Y) 



< 6 



< 6 



<6, 



(8) 



(9) 



(10) 



■\XY]S 



where p{x, y) = n"=i Pi^iiyi)- The sequences in A^^ 
called jointly (5-typical sequences. 

The jointly (5-typical set has the following properties (taken 
from lEl pages 196-197]): 

Lemma 2: Let 



(X, Y) = {X2,y2), . . . , Y^)), 



(11) 



where {Xi,Yi) are i.i.d. drawn according to p{x,y). The 
following holds for sufficiently large n: 



Pr{(X,y)G^[W]4>l 



s. 



(12) 



Lemma 3: Let {X,Y) = ((Xi, Yi ),..., (X„, F„)) where 
{Xi,Yi) are i.i.d. drawn according to p{x)p{y) (where p{x) 
and p{y) are the marginal probability distribution functions of 
p{x, y)). Then, 

PT\^iX,Y) eAfxY]s} < 2-"(^(^'^)"35)^ (13) 

Next, we have the following theorem due to Tuncel ifTSl for 
the broadcast channel with receiver side information. 

Theorem 1: Consider a broadcast channel 
p{yi:y2, ■ ■ ■ ,yL\xo) where node is the source and 
nodes 1, 2, . . . , L are receivers. Node is to send a message 
U = (L/(^),C/(2) ...,C/("^^)) to all the receivers, and each 



receiver i has side information S 
a priori. Each ([/(^), 5*"', 5*"', . . 



(oW c(2) 



is i.i.d. according 
to p{u, si, §2, . . . , sl), for all v G {1, 2, . . . , Ug}. The source 
transmits Xo{U) as a function of [/ in n channel uses. 
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Each receiver i can reliably decode U, from its n received 
channel outputs Yi and its side information Si, if Hg and n 
are sufficiently large and if 



H{U\S,) < —liXo; Y,), e {1, 2, . . . , L}, 



(14) 



for some p{xo)- 

To show achievability in the Theorem [T] joint source- 
channel decoding is utilized in the sense that each receiver 
uses its side information in the channel decoding. 

We will use the above result for the downlink of the MWRC 
in Sec. |V] On the downUnk, the relay transmits a function of 
the users' messages that it has decoded on the uplink. Each 
user i decodes the function sent by the relay from its received 
symbols and its own message Wi as side information. 

III. Upper Bounds to The Capacity Region and The 
Common-Rate Capacity 

In this section, we derive cut-set upper bounds to the 
capacity region and the common-rate capacity of the MWRC 
over a finite field. A cut-set upper bound to the capacity region 
of a network is the maximum rate that information can be 
transferred across a cut separating two disjoint sets of nodes, 
assuming that all nodes on each side of the cut can fully 
cooperate 133 page 591]. 

Theorem 2: Consider the i-user MWRC over a finite field 
F. If the rate tuple i?2, • ■ • , Rl) is achievable, then 

^min < l0g2 l-^l - H{N^) (15) 

Rl<\og^\F\-H{Ni), Vie{l,2,...,i}. (16) 

Proof of Theorem |2} Consider a network of m nodes, in 
which node i sends information at the rate Ri j to node j. If 
the set of rates } are achievable, there exists some joint 
probability distribution p{xi,X2, ■ ■ ■ ,Xm) such that the sum 
rate across a cut is constrained by ll37l Theorem 15.10.1] 



E 



R^.,<I{Xs■,Ys^Xs^), 



(17) 



for all S C {l,2,...,m}. Here Xs = {X, : i ^ S], and 
5^ = {l,2,...,m}\5. 

First, we consider the cut separating 5 = {l,2,...,L}\{i} 
for some i G {1,2,...,L}, and S'^ = {0,i}. 
The total information flow from S to iS"^ is 
(Wi,W2,...,Wi^i,Wi+i,...,WL) with the sum rate 
of ^f=i j^i ~ ^i- have the following rate constraint 
on R^, for each i G {1, 2, . . . , L}: 

Rt<I{Xs;YsAXs') (18a) 
^ H{Ys^\Xs^)- H{Ysc\Xs,Xs^) (18b) 
^ H{Yo,Y\Xo,X,) - H{Yo,Y\X^o.i....L}) (18c) 



H 



= H 



•No,N, I -H{No,N,) 

(18d) 

® iVo I + H{N,) ~ H{No) 



= H 



(SNo ~H{No), 



(18e) 



(18f) 



where (|18e| l is because ([® Xi] © A'o) and Ni are statis- 
tically independent, so are A'o and Ni. 

Now, we consider the cut separating 5 — {0,1,2,...,L}\ 
{i} for some i £ {1,2, and S'^ = {i}. 

The total information flow from S to S'^ is again 
(Wi,W2,...,Wi-i,Wi+i,...,WL) witii flie sum rate of 
R^. We have the following rate constraint on R^, for each 
{l,2,...,i}. 

R-<IiXs;Ys^\Xs^) (19a) 

= HiYs^\Xs^)- H{Ys^\Xs,Xs^) (19b) 

= H{Y\X,) - H{Y\X^„^^,„^L}) (19c) 

= H{{ho,i Xo) © N,) - H{Xo © ^»|X{o,i...,L}) 

(19d) 



H{{hojQXo)®N,)~H{N,). 



(19e) 



All achievable rate tuples must be bounded by the 
two constraints ( I18fb and (|19eb for all i and for some 
p{xt),xi, . . . ,xl)- Note that H{Ni), Vi, only depends on the 
respective noise distributions and does not depend on the 
choice of input distribution p{xo,xi, . . . , xl)- 

For any discrete random variable X G the maximum of 
H{X) is log2 and is attained by the uniform distribution 
ll37l Theorem 2.6 A]. For a random variable N G F and 
a constant h G \ {o}, from Lemma [1] there is a bijective 
(one-to-one and onto) mapping from X loY = [{h(I)X)(BN]. 
So, if p{x) is a uniform distribution, then for any N ^ n, 
p{y\n) is a uniform distribution. Averaged over all n, p{y) = 
^ni^jr p{y\n)p{n) is also a uniform distribution. So, choosing 
the independent and uniform distribution p(.To, xi, . . . , xl) = 
p^{xo)p^{xi) ■ ■ ■ p^{xl) simultaneously maximizes ( |18ft and 
(|19e| i for aU i G {0, 1, . . . , i}, giving 



Rl < log2 1^1 - H{No) 
Rl < log2 I-FI - H{N,), 



(20) 
(21) 



for all i G {1, 2, . . . , i}. Eqn. (|20] i can be further simplified 
to = inaXig{i^2,...,L} Rl < log2 \J^\-H{No). This gives 
Theorem |2] ■ 

For the common rate case, we have the following upper 
bound on the common-rate capacity: 

Corollary 1: Consider the L-user MWRC over a finite field 
The common-rate capacity is upper-bounded by 



C < 



L - 1 



loa 



I J"! - max H(N, 
ie{o,i, ■■■,-£-} 



(22) 



Proof of Corollary |7} Under the constraint R = Ri, 
Vi G {1, 2, . . . , L}, we have Rl^.^ = ^ {L ~ 1)R, Vi So, 
O and (O in Theorem [2] simplify to (i - l)iT! < log2 \J^\- 
i?(iVO, for i G {0,l,...,i}. ■ 
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IV. Fields and Linear Codes 

Random linear codes will be employed by the users to 
transmit their respective source messages to the relay in the 
FDF coding strategy. Using random linear codes, for any 
two messages the corresponding codewords are statistically 
independent, and the summation of these two codewords is 
also a codeword with the same structure and properties as 
the original codewords. With this, the relay will be able to 
decode the summation of two codewords to obtain the desired 
function of the source messages without needing to decode the 
individual messages. In this section, we present a construction 
of random linear codes with elements from finite fields, and 
prove in Theorem [3] that these codes achieve the capacity 
region of the finite field adder channel. 

Consider a message of the form s G -F*^, and a linear code 
that maps s to a length-n codeword x G J^": 



S0I 







\ 




92 




sQ 






\ 


9k. 


J 



9, 



(23a) 



(23b) 



where a; is a row vector of length n, s is a row vector of length 
A:, G is a fixed k-by-n matrix, with each element independently 
and uniformly chosen over J^, g^, the i-th row in G, is a row 
vector of length n, and q is a fixed row vector of length n, 
with each element independently and uniformly chosen over 
F. 

We will show that the codeletter of the above code is 
uniform i.i.d., and any two codewords are independent. We 
extend Gallager's results for binary linear codes 1381 pages 
206-207] to finite field linear codes in the following two 
lemmas. 

Lemma 4: Consider the linear codes defined in ( I23al ). Over 
the ensemble of codes, the probability that a message S\ is 
mapped to a given codeword X\ is p(a;i) = 1-^1 

Proof of Lemma^ There are [J^I^C^+i) ways of selecting 
G and q. As the elements are arbitrarily chosen, each (G, q) 
has a probability of iJ^j-^C^+i) of being selected. Following 
from Lemma [T] for any G, there is only one q that results in 
the given X\. So, there are only \T\^^ different (G, q)'s that 
map si to xx. Hence, p(a;i) = | j-|»fc|jr|-n(fe+i) = |jr|-«^ ^ 

Lemma 5: Consider the linear codes defined in ( I23ab . Let 
S\ and 82 be two different messages. The corresponding 
codewords, i.e.. 



Xx 

X2 



(si 0' 

(S2 0' 



9. 



(24) 
(25) 



are statistically independent. 

Proof of Lemma |5} To show independence, we need 
to find the probabilities p{xi) and p{x2\xi), and show that 
p{xi,X2) = p{xi)p{x2). Equivalently, we find the prob- 
abilities p{xi © ~X2) and p{xi\xi © -~X2), where —X2 
is the additive inverse of X2 in Let Si and S2 differ 
in the j-th position (they may differ, additionally, in other 
positions). So, Xi © —X2 = (si © — S2) G. For any 



{9i, ■ ■ ■ ■ • • ,9fc)- there is only one that results 

in the given {xi © -~X2)- Hence, there are only IJ^I"^^^^) 
different G's that give {xi © —X2)- In addition, for any chosen 
G that gives the required {xi © —X2), there is only one q 
that results in the given Xi. So, there are only IJ^I"^^"!) 
unique (G, <j')'s that give the desired {xi © —X2,xi) or 
equivalently the desired {xi,X2)- Again each (G, q) has a 
probability of | J^j-'^C^+i) of being selected. So, the probability 

piXi,X2) = IJ-I'^C^-I'IJ-I-'^C^+I) = ^ p{xi)p{x2). 

■ 

Remark 3: The key in proving Lemma |5] is to find the 
probability of the summation of the first codeword and the 
additive inverse of the second codeword, rather than the 
summation of the two codewords (as in the binary case 1381 
page 207]). Note that for the binary case, the additive inverse 
of a codeword is the codeword itself. 

Remark 4: Note that although the dither vector q is not 
required for proving that two codewords are independent 
(Lemma |5]l, it is required for proving that all codeletters 
for any codeword are independent and uniformly distributed 
(Lemma lU. 

Theorem 3: Consider a point-to-point finite field adder 
channel 

Y = X®N, (26) 

where X ^ T is the channel input from the transmitter, Y E T 
is the channel output received by the receiver, and N E T 
is the channel noise and is an i.i.d. random variable for each 
channel use. Using the linear code in ( I23al i. the source sends a 
message S, which is uniformly distributed in T'^, over n uses 
of the channel, X{S). The receiver can decode the message 
S from the n received signals Y with arbitrarily small error 
probability if n is sufficiently large and if 

/Cl0g2 1^1 



<log2\^\ -H{N). 



(27) 



Proof of Theorem \3} The source transmits X{S) = 
(S" G) © q, according to ( I23al i. over n channel uses. The 
receiver receives Y according to ( l26l l. It decodes = a if 
there is one and only one codeword X{a) that is jointly S- 
typical with the received signals, i.e., 

. (x{a),Y) eA^^y^,, and 

Without loss of generality, let S' = a be the message sent. 
The probability that the receiver makes an error in decoding 
is 

PeiTor ^MSt^ a} (28a) 
= Pr{(x(a),r) ^ AlxY]5 or (x(6),l^) G A^^y^s 
for some b 7^ a| (28b) 

<PT{[x{a),Y) ^AfxY]6} 
+ ^Pr{(x(6),Y) G^IW].}- (28c) 

From Lemma |2| we have 



Pr{(x(a),r)M[xyM}< 



S. 



(29) 
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For any b ^ a, from Lemma |4] we know that 
p{x{b)) ~ 111=1 (-^M)' from Lemma |5]we know that 
x{a) and x{b) are independent, and hence p{x{b),y) ~ 
Y[t=iP''{^[t])p{y[t])- So, from Lemma[3] we have 

Pr{(x(6), y) £ A'{^y^,} < 2-"(^"(^^^)-3*), (30) 

where P{X; Y) is evaluated with p{x, y) = p"'{x)p{y\x). Note 
that p{y\x) — p{n). 
This gives 

f'en-oi- < (5 + (I - i)2-"(^"(^;n-3*') (31a) 
<5 + 2"(^^^-[W)-3.])^ (31b) 

Choosing a sufficiently large n and a sufficiently small (5 > 0, 
if 

fclog2l-^l ^ju(^X;Y)-3S (32a) 
n 

= log2 1^1 - H{N) - 3(5, (32b) 

then PeiTor can be made as small as desired. 

So, if n is sufficiently large and if ^'^^ < log2 |~^| — 
H{N), then the receiver can decode S with an arbitrarily small 
error probability. ■ 

Remark 5: Consider a message w £ {1,2,..., 2"-^}, and 
choose an integer k such that 

fclog2 \T\ 



2nR < Ijrlfc 



(33) 



We can define an injective (one-to-one) function that maps 
each w G {1, 2, . . . , 2"'''} to a unique s G J-'^, and send s 
using the linear code (12 3 al l over n uses of the channel ( |26] |. 
For any i? that satisfies 

i?<log2|J-|-i/(7V), (34) 

we can always find sufficiently large k and n, such that 

fclog, \T\ 



R < 



< l0g2 1^1 - H(iV), 



(35) 



meaning that the receiver can reliably decode s, and it can 
then reverse the mapping from s to get the correct w. This 
means the rates in ( l34l i are achievable using linear codes. 
From ll37l pages 189-191], the channel ( l26b is symmetrical 
and its capacity is I(X; Y) evaluated with the uniform input 
distribution, i.e., r{X;Y) ^ log^ \T\ - H{N) bits/channel 
use. So, the random linear code defined in (12 3 al l can be used 
to achieve the capacity of the channel 



V. Achievable Rate Region of 
Functional-Decode-Forward 

In this section, we extend the FDF scheme developed in lITSl 
to MWRCs where the users are not constrained to transmitting 
at a common rate. Major differences are: (i) On the uplink, rate 
splitting is used, and (ii) On the downlink, joint source-channel 
decoding is used. Since rate splitting is used, we assume that 
the rates of all users, Ri, Vi G {1,2,...,L}, are rational 
number^ The reason for this will become apparent later. 

'Note that for the common-rate case, this is not required. 



We consider T message tuples. Each user i, i € 
{l,2,...,i}, sends T messages of nRi bits each, meaning 
that each user can transmit at a different rate. Denote the 
T messages of user i by (W^^l], Wi [2], Wi[T]) where 
Wi[t] G {1, 2, . . . , 2"^'} for all t. Since we consider full data 
exchange, user i needs to decode the messages sent by all 
the other users, i.e., | VF^ [t] : Vj G {1, 2, . . . , i} \ {i},\ft G 

{1,2,. ..,r}}. 

The message exchange among the users (via the relay) will 
be carried out in a total of (T+l) blocks of transmission. In the 
t-th block, for each t G {1,2, . . . , T}, each user i transmits (on 
the uplink) a codeword as a function of its t-th message Wi [t] . 
At the end of the t-th block, the relay decodes functions of 
its received signals in the t-th block. It then re-encodes these 
functions and transmits them (on the downlink) in the next 
block, i.e., the {t + l)-th block. At the end of the (t + 1)- 
th block, each user i then decodes the relay's transmission to 
obtain the t-th message of all other users, i.e., |vi^j[t] : j G 

{1, 2, . . . , L} \ {i}|. So, for each pair of the t-th block on the 
uplink and the (t + l)-th block on the downlink, if each user 
can reliably decode the t-th message of all other users, then 
repeating the same coding scheme for all t G {1,2,..., T}, at 
the end of (T+l) blocks, all users will have reliably decoded 
the messages sent by all users transmitted in the first T blocks. 

Let each block consist of n channel uses, i.e., the entire 
transmission utilizes a total of (T + l)n channel uses. Each 
user i transmits a total of TnRi bits in this transmission 
period. If each user can reliably decode the messages of all 
other users, then the rate tuple ( (^f^, (If^^ • • • . (TTTy^) 
is achievable. For any R2, . . . , Rl, and n, we can choose 
a sufficiently large T such that the achievable rate tuple is 
arbitrarily close to i?2, i?L). In this section, we derive 
constraints on R2, Rl such that the rate tuple is 
achievable. 

Since the encoding and decoding functions for all nodes are 
repeated in every block (different blocks for different message 
tuples), we focus on the first message tuple in Secs. lV-AllV-BI 
and IV-CI The relevant channel uses are the first block on the 
uplink and the second block on the downlink. For simplicity, 
we denote Wi[l\ by Wi in the these sections. 

A. On the Uplink 

Message Splitting and Mapping: 



Recall that i?^ = (E^^=i^jj - ^mm = 

mi%e{i,2 L}R] and Rl^-^ = (Et=i^j) " -^min- For the 



uplink of the MWRC, we use the idea of FDF in ||T6l com- 
bined with rate splitting. For each user i, i G {1, 2, . . . , L), 
we split its rate into 

Ri = Rmin + R'i, (36) 

where R[ > 0. So, each message Wi can be split into 

W, = {A,,B,), (37) 

where Ai G {1,2,..., 2"^™} is a random message of nR^i^ 
bits in length and B, G {1, 2, . . . , 2"-"^} is a random message 
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of ni?' bits in lengthQ. Let £>, < Z? < L, be the number of 
users whose message is strictly more than ni?niin bits. Let the 
set of these users be 



K,d2,-.-,rfi5} = ^? = {j:i?' >0}. 



(38) 



So, for all users i^V,Wi = A,, Bi = 0, and R'^ = 0. 

On the downlink, we will invoke the result in Theorem [l] 
where the relay sends messages each consisting of Ug i.i.d. 
random variables. To do this, we will further split each 
message into Ug parts, i.e., 

A, = {Af\Af\...,A^:'^^), Vze{l,2,...,i} (39) 



B, = {B 



(1) r(2) 



,Bf^), e{di,d2,...,dD}, 



(40) 



where all a["^ are independently and uniformly distributed in 
{1,2,..., 2"^"™/"='}, and all Bj^^ are independently and uni- 
formly distributed in {1, 2, ... , 2"-^^/"=}. All these messages 
will be transmitted using linear codes in T defined in (|23at . 
To do this, we define an injective function that maps each 
a e {1, 2, . . . , 2"^°™/"=} to a unique length-fcA finite field 
vector s{a) G J^'^*. This means the vector length kA must be 
chosen such that 



kATls l0g2 I J"| 



> Rrr 



(41a) 
(41b) 



This guarantees that a user can always reverse the function 
to get the correct A-"'' from S{A[ ). Similarly, for each 
j G V, we define an injective function that maps each 
I3j G {1, 2, . . . , 2"^i/"=} to a unique length-fcBj finite field 
vector s{bj) G J-^^'^ . So, fcs.j must be chosen such that 



(42) 



The length of the vector ^(7) and the corresponding mapping 

is clear from its argument 7 G {a,/3j}. 

Transmission: 

The block of n uplink channel uses are split into [L + D — l) 
sub-blocks. Each of the l-lh sub-blocks for 1 < I < L — 1 
consists of channel uses'*. Each of the /-th sub-blocks 



nR 



"^c -^+^ channel uses"*. 



for L<l<L + D— 1 consists of 
Note that if we sum the number of chaiiifiel uses in all sub 
blocks, we get 



(L-1) 



nR„ 



^ K ■ 

dev ™in 



mm 

n — = n. 

Kin 



(43) 

The first (L — 1) sub-blocks (of equal length) are used 
to send {Ai : i G {1, 2, . . . , L}}. The next D sub-blocks (of 
possibly different length) are used to send {Bj : j G T>}. 

In the Z-th sub-block for / G {1,2,..., L— 1}, only two users 
(more specifically, users / and {I + 1)) transmit, and the rest 
of the users do not transmit (which is defined as transmitting 

^Since and i?'^, Vi, ai'e rational numbers, we can choose a sufficiently 
large n such that nij„,i„ and nR'^, are integers. 



the additive identity 0). Define the transmission of user i in 
the sub-block as 



.(2) 



(44) 



The two active users transmit using hnear codes in T of the 
form defined in (I23ab. i.e.. 



X 



(„) _ /(5'(Af'^) Ga) © Qa,,, if ^ = / or / + 1 



V 



otherwise, 



(45) 



for all u G {1,2,..., Us}, where each S{A[^'^) is a row vector 
of length kA, Ga is a fixed fcA x nli'° matrijJl, each X-"^ 
and QA.i is a row vector of length , and o is the all-zero 

row vector Each element in the vectors/matrix is over 

For the next D sub-blocks, only users in V (those with 
an "extra" message Bi) transmit. We use the same notation 
in ( |44| ) for the transmitted symbols. More specifically, in the 
(L — 1 + m)-th sub-block for m G {1,2,..., D}, only one 
user, dm G V, transmits, and does so using a linear code of 
the form defined in ( I23al i. i.e.. 



otherwise, 



for all V G {1,2,..., n,,}, where S{B^^) is a row vector of 



length ks^d^, GB,d„ is a fixed ks.a 



matrix^, and 



each X^^'^ and is a fixed row vector of length . 

Each element in Ga, GB.d„,, Qai^ 9b d„ is indepen- 
dently and uniformly chosen over J^, is fixed for all trans- 
missions, and is made known to the relay. The transmission 
scheme above is summarized in Fig. |3] 
Decoding: 

In the l-th sub-block for / G {1, 2, . . . , L - 1}, the relay 

where F*"' = 



receives = {Y'^\y^^^ 



. AT^"' and 



X\%, = ([(/i,o 5(4"^)) © {hi+1,0 Ga) 

©(gAj©'ZA,;+i), (47) 



which is also a linear codeword of the form (I23ab . where the 
"message" is 

3(4%,) ^ {hi,o S{A'f^)) © (/.,+i,o G 

(48) 

From Theorem |3] if is sufficiently large and if 



^^^<log,|^|-i/(iVo), 



(49) 



then the relay can reliably decode S{A\"i^^), for all v G 
{l,2,...,n,J. 

In the {m + L - l)-th sub-block for m G {1,2,..., D}, 
only one user dm transmits at any time. The relay scales each 

^For any (possibly large) n^, we choose a much larger n such that — is 



sufficiently large, so that 



and all 



— ^^P- are integers. 
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All users transmit their respective Ai 
(of rate Rmi„) in pairs of fixed-size 
sub-blocks. 


Each user i with a data rate beyond 
(i.e., i e ©) gets an appropriately sized 
sub-block to transmit its remaining data B,. 


Sub-block 


1 


2 




L - 1 


L 


L + 2 




L-hD-l 


Transmission 


Xi{Ai) 


X^iAi) 
X-Ms) 




Xl-i{Al-i) 
Xl{Al) 


XdABd,) 






^dcl-Bdc) 








x'l' xf ... 
xf ... 


x[ 
xi 













Fig. 3. Uplink transmission 



of its received signals by q (the multiplicative inverse of 
hd^fi) to get 



(50) 



where = /i^^ (, iVo- Note that H{No) = H{Nq) as, for 
any fixed h^^ q ^ o, there is a bijective mapping between 
the two random variables (jid^ o © -^o^ and A^o- Applying 
Theorem [3] if ^"'^r is sufficiently large and if 

fcB,d,„ l0g2 1-7^ 



B. On the Downlink 

Now, assume that the relay decodes U in the first block of n 
uplink uses, it broadcasts this information in the second block 
of n downlink uses. For decoding on the downlink, each user 
i, i S {1, 2, . . . , L}, scales each of its received signals by h^l 
to get 

% = h-]QY,^Xo®N,, (58) 



where Ni = h^} N„ and H{Ni) = H{Ni). 

Note that each U^'^'^ is i.i.d., for all v € {1, 2, . . . , n^}. 



nR' 



<\og^\T\- H{No) ^log^lTl- H{No), so are S{A]y^^) for all v, and S'(Bf^) for all v. 

We use U, Si,i+i, and Si to denote the respec- 
ts 1) live generic random variables. Thus, we have U = 

With this, we can re-cast the downlink as a broadcast 



then the relay can reliably decode 5(5^"'') from I^q 
Define 



^dl + -^f)"'' for all w e {1, 2, . . . , n J. 



channel in which the relay broadcasts a message U = [(7^^-* 



c/("'^(5(4:^),5(4:i),...,5(4'ii.L) 



do I r 



and 



U^(C/(i\i7(2\...,C/("=)). 
On the uplink, if 

Rl,,„<\og^\F\-H{N^), 



(52) 
(53) 

(54) 



we can always find sufficiently large kp^, and 

{fcB,<i,„}d,„er>, such that 

i?™„ < i?™„^^^^?^ < log2 1-7^1 - H{N,) (55) 



DC ^ DC 

min — min 



kB,d^ns log2 \T\ 
nR' 



<\og^\T\-H{No), yd^eV, 



(56) 



to all the users, where each user i G V knows [S{B^ )]\/v 
(which is correlated with the message U) a priori. So, each 
user i gV can use its side information [S'(i3j-'''')]vu to decode 
U from its scaled received signals Yi during channel decoding 
(hence joint source-channel decoding). Note that all users do 
not need to use their respective Ai as side information for 
decoding U (see Remark |6l). From Theorem [T] all users can 
reliably decode U if Us and n are sufficiently large and if 

nsHiU\S,) < nIiXo;Yd, e V (59) 
UsHiU) < 72liXo;Yd, V2 ^ P, (60) 

for some p{xo). Note that S{b'>'"'') = 0ifi ^V. Choosing the 
uniform distribution for Xq, /(Xq; Yi) = log2 \J^\ — H{Ni) = 
log2 I-FI - H{N,), for all ^ e {1, 2, . . . , L}. 

(v) 

Since the mapping from (which is uniformly dis- 

tributed in {l,2,...,2"-^i/"=}) to 5(5*"^) is injective, we 
have, for all i £ 23, 



meaning that ( |41bt . ( |49] l and ( |42] |. ( fSTl i can be satisfied in 
their respective sub-blocks. So, if ( l54l i is satisfied and if ^ is 
sufficiently large, the relay can reliably decode U. 

Eqns. (|54] | and ( |55] | also mean that ^"^^ can be 

chosen arbitrarily close to Rmin, i.e.. 



Since S'i.i+i G J-'"^, we have 



HiS^.^+l) < fcAl0g2 1^1 



(61) 



(62) 



fcA"-s l0g2 1-7^1 



(57) 



with equality if and only if Si^i+i is uniformly distributed in 
J''^*. Note that each A^"^ 



where ry > can be chosen arbitrarily small. 



not imply that S{A^j^^,^) is uniformly distributed. 



Vz, being uniformly distributed does 
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This gives 
HiU) 



L-l 



= ^ (Si^.+i {Sj^j+i : for all j < i and j > 1} 

D 

+ i? (S'rf, {Sdt ■■ for alie<k and £ < l}, 

}) 



{5. 



1 < m < L - 1 



<(L-l)A:Alog2|-F| + 



(lev 



(i-l)-(i?n 



(-^min + C) , 



v) 



n 
n.. 



(63a) 
(63b) 

(63c) 
(63d) 



(lev 



J2R'd + {L- 1)77 



(63f) 



where 77 is defined in dSTl i. and ( = (i— 1)77 > can be chosen 
arbitrarily small. Here, (16 3 al l follows from the chain rule, and 
(I63bl i is because conditioning can only reduce entropy. 
It follows that for all i G V, 



H{U\S,) = H{U) + H{S,\U) - H{Si) 
= H{U) - H{S, 




(64a) 
(64b) 
(64c) 

(64d) 
(64e) 



where C > can be chosen arbitrarily small. Here, ( I64bl i is 
because H{S^\U) = 0. 

Note that for all i ^ V, R\ = 0, meaning Ri = Rmin, and 
hence R\ = R"^-^. Now, for all 7 e {1, 2, ... , L}, if 

i??<log2|^|-i/(iV0, (65) 



which is equivalent to 

R'i + ^ = l0g2 l-^l 



-H{N,), for some 7/' > 0, (66) 
I for (|63B and ( |64e] i so that and 



we can then choose ( 

(|60] | can both be satisfied, i.e., all users can reliably decode 
with sufficiently large 77^ and 77,. 

Note that on the downlink, linear codes are not required. 

Remark 6: Consider the two-user case (i.e., L = 2) where 
Ri = R.2 = Rmin- So, the two messages are Wi — Ai and 

c.f. |57} 

W2 = A2. Ideally, we choose fcA such that nRmia/ris ~ 



c.f. (62) 



we 

(i.) 



fcAlogalJ"! "■■■^ H{S{A[''1)). Since, U'^'"^ = S{A'fl) 
have ff(t7('')) = H{S{A[''1)) « fcAlogjlJ"!. Since A'^ 
and Aj"'' are uniformly distributed in {1, 2, . . . , 2"^™°/"= }, 
we have H{A[''^) = = nR,^in/ns. Given a[^'\ the 



only uncertainty left in (7^"-' is that of A2 ■ This means 

H{U^-^\A'f^) = H{A^^^) = 77i?,„„/77,, « fcAl0g2|^| « 

i?(C/('')). Similarly, we can show that iJ([7("V2"') ~ 
H{U^''^). So, each message, A J"' or ^2^'', individually con- 
veys very little information about U^^\ This explains why we 
do not lose optimality by not using Ai as side information 
when each user decodes U on the downlink. 



C. Decoding of Other Users' Messages 
Assume that every user i, for all i 



G {1,2,. ..,L}, 

correctly decodes U, i.e., U^''^ (^S {A^^l) , S {A^2,l) ^ 

. . . , 5(4"!,^ J, SiB^,l\ SiBil\ for all 7. £ 

{1,2,..., 77s}, sent by the relay. Since ( |42] | is true, user i can 



correctly decode sj"-* from S{B'"j"'), for all j G V. Recall 



that B 



{v) 



5, for all k ^V. 



Then user i performs the following: 



(63e) 5(<\) = (/77+\,o0 5(</+i)) 



S{A 



i+2/ 



iK+2,oQSiAf^,^,^2)) 
® -(V+2,0 /i^+i.o 



5(4")) 



^ (/7r,i © -(/ir.j /72,o 5(4"^)), 



-'iK\>QSiA[\^)) 

®-(/7^},0/lL_l,O0 5(4"li)) 

:(V-\.O0^(4^-\.) 

®-(/7-4o 0^0 5(4')) 

:(/7r4o0 5(A(!)2,_i) 
(B-{h-X,Qh,-i,oQSiA^J,)) 



(67a) 
(67b) 

(67c) 
(67d) 
(67e) 



(67f) 



to 



get {S{Ar),S{4^>),...,SiA^"J,lS{A^^,),. 



5(y4^"'')). Since (I41bb is true, user i can correctly decode A 



{v) 



from 5^(4"'). for all j G {1, 2, . . . , i} \ {i}. Repeating that 
for all V G {1, 2, . . . , 77^}, user i then obtains all other users' 
messages, i.e., = {Aj,Bj) : j G {1, 2, . . . , i} \ {ijj. 



D. Probability of Error 

In the above analyses, we focused on the first message 
tuple. Now, we consider all T message tuples. On the uplink, 
let the decoding error at the relay in the 7;-th fraction of 
the l-th sub-block of the i-th message tuple be Pe(0,i, 
fort G {l,2,...,r}, ? G {1,2,...,L + L> - 1}, and 
V G {1, 2, . . . , 77s}. On the downlink, let the decoding error at 
user i (of the message U sent by the relay) of the t-th message 
tuple be Pe{i,t), for i G {1, . . . , L} and t G {l,2,...,r}. 

For the t-th message tuple, from Section IV-AI if ^ is 
sufficiently large and if (l54b is satisfied, then Pe{0, t, I, v) < ei 
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for any ei > 0, for all I and v, meaning that the relay can 
reliably decode U. If the relay correctly decodes U (of the 
t-th message tuple) and transmits it on the downlink, from 
Section [V-BI with Ug and n sufficiently large and ( l65T l satisfied, 
all users can reliably decode U, i.e., Pf.{i,t) < €2 for any 
£2 > 0, for all i 6 {1,2,...,L}. 

Note that Pe{i, t) for the users, i.e., i 7^ 0, are found condi- 
tioned on the event that the relay has correctly decoded U (of 
the i-th message tuple in the previous block of transmission). 
When we calculate the end-to-end error probability, Pf., in the 
remaining of the section, we will show that the event that 
the relay wrongly decodes (or correctly decodes parts of) U 
can be made arbitrarily small (i.e., we do not assume that the 
relay correctly decodes U). Combining this with the fact that 
the probability that some users wrongly decode (or correctly 
decode parts of) U given the relay has correctly decoded U 
can also be made arbitrarily small, we can make Pg as small 
as desired. If the relay makes a decoding error, the error 
propagates onto the downUnk to the users. But we can make 
the probability of this event arbitrarily small. 

Now, if ( |54l i is satisfied, we have 

Pr{Relay makes some decoding error(s)} 

T L+D-1 ris 

< Pr |Relay wrongly decodes S'(A|"^j) or 



t=l i=l v=l 



*s sub-block for 
the i-th message tuple| (68a) 

= E E (68b) 

t=l 1 = 1 v=l 

< {L + D -l)Tn,€i, (68c) 
and so 



T L+D-1 ris 



Pr{Relay makes no error} > 1 - {L + D - l)rn^ei. (69) 

Conditioned on the event that the relay makes no decoding 
error, if ( |65] ) is satisfied, we have 



Pr I Some user(s) makes some decoding error(s) 
I Relay makes no error| 

L 

Pr I User i makes some decoding error(s) 

Relay makes no errorj (70a) 

<EE^e(M) 



4=1 



L T 



1=1 t=l 

< LTe2, 



(70b) 
(70c) 



and so 



Pr I No user makes any decoding error 

I Relay makes no error} > 1 — LTe2- (71) 



This gives 

Pr{No user makes any decoding error} 

>[1-{L + D -l)Tnse{\[l- LTe2i (72) 



and 



Pf. = PrjSome user(s) makes some error(s)} (73a) 

< 1 - [1 - (L + D - l)Tn,ei] [1 - LTea] (73b) 

<{L + D- l)Tn,ei + LTe2 - (L + D - l)LT^n,eie2, 

(73c) 

where ei ^ as ^ s- 00, and £2 — )• as tIs, 71 — > 00. The 

RHS of ( |73ct can be made arbitrarily small for any L, T, D 
(note that D < L), by choosing a sufficiently large Ug and 
much larger n, such that ^ is also sufficiendy large, making 
Pe arbitrarily small. 



E. The Capacity Region of the MWRC over a Finite Field 

The preceding analysis means that all rate tuples 
{Ri, R2, ■ ■ ■ , Rl) satisfying (|54] | and (l65T l are achievable. 
Comparing this achievable region with the capacity upper 
bound in Theorem |2] we have the following capacity theorem. 
Theorem 4: Consider the i-user MWRC over a finite field 
The capacity region is the set of all non-negative rate tuples 
{Ri,R2,...,Rl) satisfying 

Rl,,, < log, \T\ - H{No) (74) 
Rl<log2\:F\-H{N,), Vz G {l,2,...,i}. (75) 



Remark 7: Note that in the PDF coding strategy proposed 
above, each user's transmitted signals only depend on its 
message and do not depend on its received signals, i.e., 
Xi[t] ~ fi,t{Wi), \/i,t. Since this is sufficient to achieve the 
capacity region, the capacity region remains the same even if 
we consider the restricted MWRC where the users' transmitted 
signals can only depend on their respective messages and 
cannot depend on their received signals. This means utilizing 
feedback does not increase the capacity region of MWRCs 
over finite fields. 

Remark 8: The capacity region in Theorem |4] is equivalent 
to the set of all rate tuples (Pi, P2, . . . , Rl) satisfying 



Rl < log2 I-FI - m^x{H{No),HiN,)}, 



(76) 



for all i e {1,2,...,P}. 

Now, we show that the capacity region in Remark |8] 
denoted by TZ, is convex and hence the convex hull 
operation is not required. Let two rate tuples be 



iR['\4'\...,Rl'>),{RY',R'2 R'l') e n. For 



?(2) n(2) 



(2)n 



any a G [0,1], define {R];',R^ 



(3) n(3) 



,P 



(3)n 



such that 



p: 



(3) 



aPj^^' + (1 — a)R\''' , Vi. For this rate tuple, and for 



?(2 
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(77a) 



all i G {1,2, . . . , L}, we have 

i?f)^A^^(3)_^(3) 
L 

= Y^iaRf^ + (1 - a)Rf) - + (1 - a)i?f 

(77b) 



^ + (1 - a)i?f )^ 

< log2 1^1 - HiNo), 

where (I77db follows from ( f76] l. 
From (I77cl l and ( |76] |, we get 



i?f )^ < log2 1^1 - H{N,] 

?(3) n(3) n(3)N 



(77c) 
(77d) 



(78) 



So, the rate tuple % i?^ % . . . , R^^') e U, meaning that U 
is convex. 

F. The Common-Rate Capacity of the MWRC over a Finite 
Field 

Consider the common-rate case where all users transmit at 
the same rate, i.e., Ri = Rmin, for all i G {1,2,..., L}. We 
have Wi = Ai and Bi = 0, for all i, i.e., rate splitting is not 
required. So, using FDF, on the uplink, only the first (L — 1) 
sub-blocks are required for each message tuple for the users to 
transmit their respective Wi in pairs. On the downlink, since 
Bi = for all i, the users do not need to use their own 
message in decoding U (c.f. (|59]l-(l6Q]l), i.e., joint decoding is 
not required. The users only utilize their respective messages 
in steps (|67at -( |67f| i after they have decoded U. FDF without 
rate splitting and separate source-channel decoding achieves 
the common-rate capacity, stated in the following corollary. 

Corollary 2: Consider the L-user MWRC over a finite field 
F. The common-rate capacity is 



C = 



1 



L - 1 



log, \T\ 



max H(Ni 
ie{o,i....,L} 



(79) 



Proof of Corollary^ For the common-rate case, Ri = R, 
yi e {1,2,..., L} and we have R"^^ = R\ ^ {L - l)R, 
From Theorem ID all non-negative rate tuples {R, R, . . . ,R) 
satisfying 

{L-l)R<log^\T\-H{N.,), \/te{0,l,...,L}, (80) 



are 



achievable. 



So, 



common 



rates up to 
(log2 \T\ — niaxjgjo 1 H{Ni)^ /{L — 1) are achievable. 
From CoroUary [1] we know that this is a capacity upper 
bound. ■ 



VI. A Case Study: The Binary Two-Way Relay 
Channel 

In this section, we study the special case of the binary 
TWRC to illustrate the role of rate-splitting and joint source- 
channel decoding in achieving the capacity region. In the 
notation of this paper, we study the case where L = 2, 
T = {0,1} = J^2, © and are addition and multiplication 
in modulo-two respectively. By definition, /ii o = /12.0 = 



^0,1 = ^0,2 = 1, since they cannot be zero. For the binary 
TWRC, the noise variables A^o. ^1- and A^2 are each binary, 
and we can define pi G [0, 1] such that pi = Pr{iVi = 1} and 
E{pi) = H{Ni) = - pi\og^pi~{\~ pi)\og^{\- Pi). Without 
loss of generality, we consider pi G [0, ^] for all i G {0, 1, 2}. 
Although the capacity region of the binary TWRC has been 
reported in ifTOl . 1(121 . we use this example to highlight the 
components of our scheme and to compare FDF with the 
complete-decode-forward (CDF) strategy. 

A. Functional-Decode-Forward with Rate Splitting and Joint 
Source-Channel Decoding 

From Theorem m FDF with rate splitting and joint source- 
channel decoding achieves all non-negative rate pairs {Ri, R2) 
satisfying 



Ri,R2 <l-H{po) 
Ri<l- H{p2) 
R2 <1-H{pi), 

whose closure gives the capacity region. 



(81) 
(82) 
(83) 



B. Functional-Decode-Forward with Rate Splitting and Sepa- 
rate Source-Channel Decoding 

Now, we find the achievable rate region using FDF with 
rate splitting but with separate source-channel decoding. 

The coding on the uplink is the same as that in Sec. lV-Al i.e., 
using linear codes, functional decoding and rate splitting. First, 
we assume that R2 > Ri, and hence Wi = Ai and W2 = 
{A2,B2). So, on the uplink, from dSli, if i?2 < 1 - H{po), 
then the relay can reUably decode ([5(Ai"^)]v„, [5(5^"^ )]v.). 

Now, instead of using the joint source-channel decoding 
for the downlink described in Sec. IV-BI we will use separate 
source-channel decoding in the sense that the users do not 
use their own messages in channel decoding. We re-cast the 
downlink as a broadcast channel with degraded message sets 
Il39] , where a source broadcasts a common message to two des- 
tinations and a private message to one of the destinations, and 
where both the destinations do not know the messages a priori. 
Applying this to the downlink of the binary TWRC, we have 
the relay sending [5'(A^"2)]vu to both users, and [S^B^^)]-^^, 
to user 1, and the users do not use their own messages in the 
channel decoding of [S{A^^I)]yv and [S{B^^^) 



V'U- 



Recall that [S'(ylj"2)]vD is an ni?i-bit message and 
[S{B^2^)]yv an nR'^-m message. From ||39l, if i?i < 1 - 
iJ(/3(l-p2) + (l-/3)p2), R'2 < i?(/3(l-pi) + (l-/3)pi) - 
H{pi), and Ri+R'^<l- H{pi), for some < /3 < i, then 
both the users can reliably decode [S^(A^^2)]vu and user 1 
can reliably decode [S{B^'^)]\/y purely from their respective 
received signals Yi. Of course, after decoding [S'(A^^2)]vt) 
and [S{B^'')]\/v (for user 1), the users must follow the steps 



in ( I67al )-( l67fl ) to obtain the other user's message. But as far 
as channel decoding on the downlink is concerned, the users' 
own messages are not used (as side information). 

Combining the rate constraints on the uplink and on the 
downlink, we have the following achievable rate region: 
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Theorem 5: Consider the two-user MWRC over J^2- FDF 
with rate splitting and separate source-channel decoding 
achieves the convex hull of TZi and 7?.2, where 

• TZi is the set of all non-negative rate pairs {Ri,Ri +R2) 
satisfying 

i?i < 1 - H{I3{1 - P2) + (1 - P)p2) (84) 
i?^ <i7(/3(l-/7i) + (l-/3)pi) (85) 
Ri+R'2<l- m(ix{H{po),H{pi)}, (86) 

for some < /3 < i. 

• Tl2 is the set of all non-negative rate pairs (i?2 + ^'1, ^2) 
satisfying 

R2<-H{a{l-pi) + il-a)pi) (87) 
R[ < H{a{l - p2) + (1 - a)p2) - H{p2) (88) 
R2+R'i < l-m&x{H{po),H{p2)}, (89) 

for some < a < ^. 

Proof of Theorem^ TZi follows directly from the above- 
mentioned rate constraints. 7^2 is obtained by reversing the 
role of users 1 and 2 for the case Ri > i?2. Using time 
sharing, the convex hull of TZi and TZ2 is achievable. ■ 

Remark 9: We can show that when pi < p2, TZ-i Q Til', 
and vice versa. Hence, for any channel setting, it is sufficient 
to consider only one region in Theorem |5] 

Now, we show that FDF with rate splitting and separate 
source-channel decoding achieves the capacity region of the 
binary TWRC under certain conditions. 

Lemma 6: Consider the two-user MWRC over 7^2- If 

1) Pa > max{pi,p2}, or 

2) pi = P2, 

then FDF with rate splitting and separate source-channel 
decoding achieves the capacity region. 

Proof of Lemma^ First, consider the case pi < p2, i.e., 
H{p2) > H{pi). If 

po>P2^ H{po) > H{p2), (90) 

we have 

l-H{po)<l-H{p2)<l-H{pi). (91) 

Then by setting /3 = 0, i.e., i?2 = 0' ^1 Theorem|5]becomes 

{{Ri,R2):0<RuR2<l-H{po)}. (92) 

The closure of the above region coincides with the capacity 
region since (ISTT i implies ( |82] | and ( |83] | when ( |9T] l is true. 

Similarly, for the case of p2 < pi, if po > Pi, then the 
closure of 7^2 (with q = 0) in Theorem |5] coincides with the 
capacity region. 

Next, consider the case pi = p2, i.e., H{pi) = H{p2). By 
setting /? = 0, i.e., R'2 = 0, TZi in Theorem |5] becomes 

{{R1.R2) : < i?, < 1 - -ff(pi),0 < R,<1- Hipo), 

fori = 1,2}, (93) 

whose closure also coincides with the capacity region. ■ 



C. Complete-Decode-Forward 

Using CDF, the relay fully decodes both Wi (of nRi bits) 
and W2 (of ??i?2 bits) on the uplink, which is a multiple-access 
channel. So, if 

i?i < 1 - Hipo) (94) 
i?2 < 1 - H{po) (95) 
i?i+i?2 < l-i/(po), (96) 

then the relay can rehably decode Wi and W2 EOl, gll. Note 
that implies and ( |95] l. 

Assuming that the relay has successfully decoded Wi and 
W2, it broadcasts (VFi,M^2) on the downlink. Using joint 
source-channel decoding, each user i, i E {1, 2}, can reliably 
decode the other user's message from their respective received 
signals Yi and their own messages Wi if ll42ll . ||43]| 

i?i < 1 - Hip2) (97) 
i?2 < 1 - H{pi). (98) 

Combining the uplink and the downlink constraints, the 
achievable rate region using CDF is given by the following 
theorem: 

Theorem 6: Consider the two-user MWRC over 7^2- CDF 
achieves all non-negative rate pairs (i?i,i?2) satisfying 

i?i < 1 - H{p2) (99) 

R2<l-H{pi) (100) 

i?i+i?2 < l-H(po). (101) 

CDF achieves the capacity region under the following 
conditions. 

Lemma 7: Consider the two-user MWRC over 7^2- If 

Hipo)<H{p,)+Hip2)-l, (102) 

then CDF achieves the capacity region. 
Proof of Lemma [7} 

Hipo) <Hip,) + H{p2)-l (103) 
^ l-H{po)>l^H{pi) + l-Hip2) (104) 
^ H{pi) > Hipo) and i/(p2) > Hipo). (105) 

From ( |104t , we know that conditions ( |99] l and dlOOl ) imply 
dlOll ). In this case, CDF achieves the following rate region 

{(i?i,i?2) :0<i?i <l-Hip2),0<R2 <l-Hipi)}, 

(106) 

whose closure is the capacity region since ( I105l l, ( l82b and ( |83] ) 
imply dlB- ■ 

D. Numerical Calculations and Discussion 

We denote FDF with rate splitting and joint source-channel 
decoding by FDF-RS (joint), and FDF with rate splitting and 
separate source-channel decoding by FDF-RS (separate) for 
the discussion in this section. 

In Fig. m we compare FDF-RS (joint), FDF-RS (separate), 
and CDF for the following channel parameters: po = 0.1, pi ~ 
0.05, and p2 = 0.2. In this example, the FDF-RS (separate) 
achieves a rate region strictly larger than that of CDF, but both 
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Fig. 4. Rate region comparison for l-H(po) = 0.531, l-H{p\) = 0.714, 
and 1 - H{p2) = 0.278 



regions are strictly smaller than the capacity region which is 
achievable by FDF-RS (joint). 

In Fig. |5] we fix po ~ 0.25 and plot the range of pi 
and p2 for which the capacity region is achieved by FDF- 
RS (separate) or CDF. The top-right corner corresponds to a 
noisier downlink (pi,p2 > Po) , while the bottom-left corner 
to a noisier uplink (po > pi,p2)- 

For the capacity region in Sec. IVI-AI we refer to the 
constraints dSTT l as the uplink constraints on the capacity 
region, and (l82]i-(l83Tl the downlink constraints on the capacity 
region. 

Using CDF, the relay needs to fully decode the users' 
messages on the uplink, and this restricts the sum rate to be 
constrained by the uplink, c.f. dlOlb . When the uplink is noisy 
and is the channel bottleneck, the capacity region is effectively 
constrained by the uplink constraint (ISTT l. which is strictly 
more relaxed than (llOll i. So, CDF is not uplink optimized. 

However, when the downlink is noisy such that H{po) < 
H{pi) + H{p2) — 1, the capacity region is effectively con- 
strained by the downlink constraints ([82]i-(|83]l. which is 
achievable by CDF, as shown in Lemma|7]and plotted in Fig.|5] 
We say that CDF is downlink optimized. 

Using FDF-RS (separate), the users' a priori knowledge 
about their own messages is not utilized during the channel 
decoding on the downlink - their own messages are used only 
after channel decoding. So, FDF with separate source-channel 
decoding is not downlink optimized. This is why when the 
downlink is noisy (pi > po or p2 > po), FDF-RS (separate) 
fails to achieve the capacity region. An exception is when 
pi = p2, i.e., the downlink is symmetrical, in this case, the 
equal rate point (common rate) marks a vertex of the capacity 
region and from Corollary [2] we know that FDF with separate 
source-channel decoding achieves the common-rate capacity. 

On the uplink, FDF-RS (separate) performs functional de- 
coding at the relay and is able to achieve the uplink constraint 
on the capacity region. As shown in Lemma |6] and plotted in 
Fig. [5] when the uplink is the channel bottleneck, FDF-RS 
(separate) achieves the capacity region. 

From Fig. |5] we see that using both CDF and FDF-RS 
(separate) does not cover the capacity region for all channel 
settings. On the other hand, FDF-RS (joint) is both uplink and 
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Fig. 5. This figure shows the regions of channel parameters (pi,P2) for 
which the capacity region for po = 0.25 is achieved by CDF and FDF-RS 
(separate). The capacity region for all (pi,P2) can be achieved by FDF-RS 
(joint). 



downlink optimized, and it achieves the capacity region for all 
channel settings. 

VII. Conclusion 

We have proposed a functional-decode-forward (FDF) cod- 
ing strategy with rate splitting and joint source-channel decod- 
ing that achieves the capacity region of the multi-way relay 
channel (MWRC) over finite fields. For the special case where 
all users transmit at the same rate, our proposed FDF achieves 
the common-rate capacity of MWRCs over finite fields without 
requiring rate splitting or joint source-channel decoding. 

Using the two-user binary MWRC as an example, we 
showed that both FDF with rate splitting and separate source- 
channel decoding (denoted by FDF-RS (separate) in Figs. |4] 
and Is), and complete-decode-forward (CDF) fail to achieve 
the capacity region of the MWRC as (i) for the former, users' 
messages are not utilized for channel decoding on the down- 
link and (ii) for the latter, the relay is constrained to decoding 
all users' messages. We noted that the shortcoming of CDF 
corresponds to the strength of FDF with rate splitting and 
separate source-channel decoding, and vice versa. However, 
as seen from Fig. |5] even considering both strategies does not 
cover the capacity region for all noise distributions. 

Our proposed FDF with rate splitting and joint source- 
channel decoding overcomes these shortcomings by having 
the relay decode only functions of the source messages on 
the uplink, and having the users utilize their own messages 
in channel decoding on the downlink. This strategy indeed 
achieves the capacity regions of MWRCs over finite fields 
for all noise distributions. Our proposed coding strategy can 
be applied to the general multi-source multi-destination multi- 
relay network, where the relays facilitate data exchange among 
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different source-destination pairs, but are themselves not re- 
quired to decode the source messages. 
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